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Abstract 

A DNA polymerase (DNAP) replicates a template DNA strand. It 
also exploits the template as the track for its own motor-like mechanical 
movement. In the polymerase mode it elongates the nascent DNA by one 
nucleotide in each step. But, whenever it commits an error by misincor- 
porting an incorrect nucleotide, it can switch to an exonuclease mode. In 
the latter mode it excises the wrong nucleotide before switching back to 
its polymerase mode. We develop a stochastic kinetic model of DNA repli- 
cation that mimics an tn-vitro experiment where a single-stranded DNA, 
subjected to a mechanical tension F, is converted to a double-stranded 
DNA by a single DNAP. The _F-dependence of the average rate of repli- 
cation, which depends on the rates of both polymerase and exonuclease 
activities of the DNAP, is in good qualitative agreement with the corre- 
sponding experimental results. We introduce 9 novel distinct conditional 
dwell times of a DNAP. Using the methods of first-passage times, we also 
derive the exact analytical expressions for the probability distributions 
of these conditional dwell times. The predicted _F-dependence of these 
distributions are, in principle, accessible to single-molecule experiments. 

1 Introduction 

A linear molecular motor is either a macromolecule or macromolecular complex 
that moves along a filamentous track [TJ [51 131 H] . In spite of its noisy stepping 
kinetics, on the average, it moves in a directed manner. Its mechanical work is 
fuelled by the input energy which, for many motors, is chemical energy. The 
distributions of the dwell times of a motor at discrete positions on its track 
as well as the duration of many complex motor-driven intracellular processes 
have been calculated [H HJ [71 |H1 [HI [13 [H] using the methods of first-passage 
times |13j . a well-known formalism in non-equilibrium statistical mechanics. 
Experimentally measured distributions of dwell times of a motor can be utilized 
to extract useful information on its kinetic scheme [TOl [E] . 

For motors which can step both forward and backward on a linear track, 
four distinct conditional dwell times can be defined; distributions of these four 
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conditional dwell times have been calculated for some motors [71 |H1 HH [H]- 
In this paper we consider a specific molecular motor called DNA polymerase 
(DNAP) and argue that its movements on the track is characterized by nine 
distinct conditional dwell times because of the coupling of its dual roles during 
its key biological function. We define these nine conditional dwell times and 
calculate their distributions analytically treating each of these as an appropriate 
first-passage time. As a byproduct of this exercise, we obtain an important 
result that characterizes one of its average properties; the predicted behaviour 
is consistent with the corresponding experimental observations reported earlier 
in the literature. The distributions of the nine conditional dwell times are new 
predictions which, we believe, can be tested by single-molecule experiments. 

Deoxyribonucleic acid (DNA) is a polynucleotide, i.e., a linear heteropoly- 
mer whose monomeric subunits are drawn from a pool of four different species of 
nucleotides, namely, A (Adenine), T (Thymine), C (Cytosine) and G (Guanine). 
In this heteropolymer the nucleotides are linked by phosphodiester bonds. The 
genetic message is chemically encoded in the sequence of the nucleotide species. 
DNA polymerase (DNAP) |15J, the enzyme that replicates DNA, carries out 
a template-directed polymerization |16| . During this processes, repetitive cy- 
cles of nucleotide selection and phosphodiester bond formation is performed to 
polymerize a DNA strand. In every elongation cycle, hydrolysis of the substrate 
molecule supplies sufficient amount of energy to the DNAP for performing its 
function. Therefore, DNAPs are also regarded as molecular motor [J O [31 13] 
that transduce chemical energy into mechanical work while translocating step- 
by-step on the template DNA strand that serves as a track for these motors. 

In an in-vitro experiment, Wuite et al. [17] applied a tension on a ssDNA. 
The two ends of this DNA fragment were connected to two dielectric beads; 
one end was held by micro-pipette, while the other end, trapped optically by a 
laser beam, was pulled. This DNA fragment also served as a template for the 
replication process carried out by a DNAP. Replication converted the ssDNA 
into a dsDNA. The average rate of replication was found to vary nonmono- 
tonically with the tension applied on the template strand [17) . Similar results 
were obtained also in the experiments carried out by Maier et al. [18], where 
magnetic tweezers were used to apply the tension on template DNA. The ob- 
served nonmonotonic variation of the average rate of replication was explained 
[T71 [T51 [TOl dOl m] as a consequence of the difference in the force-extension 
curves of ssDNA and dsDNA [^, . 

Operation of a DNAP is error-prone. But, unless corrected, replication er- 
ror would result in a defective genome for the daughter cells [?]. Therefore, 
DNAP is capable of correcting most of its own error during the ongoing repli- 
cation process itself. A DNAP performs its normal function as a polymerase by 
catalyzing the elongation of a new ssDNA molecule using another ssDNA as a 
template. However, upon committing a misincorporation of a nucleotide in the 
elongating DNA, the DNAP can detect its own error and transfer the nascent 
DNA to another site where it catalyses excision of the wrongly incorporated 
nucleotide. The distinct sites, where the polymerisation (pol) and exonuclease 
(exo) reactions are catalyzed, are separated by 3-4 nm on the same DNAP 
The nascent DNA is transferred to the pol site from the exo site after the wrong 
nucleotide is cleaved from its tip by the DNAP. Thus, the transfer of the DNA 
between the pol and exo sites couples the polymerase and exonuclease activities 
of the DNAP. 



2 



In the next section we develop a microscopic model for the replication of a 
ssDNA template that is subjected to externally applied tension F, a situation 
that is very similar to the in-vitro experiment reported in refs. |17[ I18| . The 
rates of both pol and exo activities of the DNAP enter into the expression that 
we derive for the average rate of elongation of the DNA. The F-dependence of 
this rate is consistent with the experimental observations reported in [17[ 118) . 
We then define 9 distinct conditional dwell times of the DNAP and identifying 
each of these with an appropriate first-passage time [13], we calculate their 
distributions analytically. We believe that experimental measurements of these 
distributions are likely to elucidate the nature of the interplay of the pol and 
exo activities of DNAP. 

2 Model 

The nucleotides on the template DNA are labelled sequentially by the integer 
index j {j = 1,2, which also serves to indicate the position of the DNAP 

on its track. The chemical (or conformtional) state of the DNAP is denoted by 
a discrete variable n — 1, 2..., 5). The state of the DNAP is during replication 
is described by the pair j, fj,. The kinetic scheme used for our model is adapted 
from that proposed originally by Patel et al. [25^ and subsequently utilized by 
various other groups [30] ■ The kinetic scheme of our model is shown in figure 
([T]), where the four different values 1, 2, 3 and 4 of /x are the allowed chemical 
states in the polymerase-active mode of the enzyme, while in chemical state 5 
the exonuclease catalytic site is activated. 

The structure of DNA polymerase resembles a "cupped right hand" of a 
human, where its sub domains are recognized as palm, thumb and finger sub 
domains [57^ . Template DNA enters from the finger sub-domain and takes exit 
from thumb sub-domain. The catalytic site where the binding occurs is located 
between finger and palm domain. Transitions between polymerase activated ki- 
netic states of the enzyme (i.e., chemical states 1,2, 3 and 4) can be summarized 

as [iiisn] 

EoDj+dNTP ^ EoDjdNTP ^ EaD^dNTP ^ EcDj+iPPi ^ EoDj+i (1) 

k—i k — 2 fe — 3 fe_4 

where Ec and Eq represent the closed and open finger configuration DNAP, 
respectively, while Dj denotes the length of the nascent DNA strand. 

Let us start with the state EgDj, labelled by ^ = 1, in which the finger 
domain of DNAP is open and the DNAP is located at the site j on its tem- 
plate. Now a substrate molecule (dNTP) binds with the DNAP and resulting 
state EoDjdNTP is labeled by "2" . The transition 1^-2 take place with 
rate fci, while corresponding reverse transition 2 — > 1 occurs with rate fc_i. 
Binding energy of dNTP switches the open finger configuration of DNAP into 
closed finger configuration and the corresponding transition 2 (EoDjdNTP) 

3 [EcDjdNTP] take place at the rate ^2- The reverse transition 3 — > 2 oc- 
curs at the rate k-2- This new closed finger configuration of DNAP catalyzes 
the formation of phosphodiester bond between dNTP and nascent DNA strand 
thereby elongating the nascent DNA from legth j to j + 1; this process is repre- 
sented by the transition 3 (EcDjdNTP) — 4 (EcDj+iPPi) that occurs at the 
rate {k-3 being the rate of the reverse transition). Finally, the transition 
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4(j) — > l{j + 1) completes one elongation cycle; the corresponding rates of the 
forward and reverse transitions are and fc_4, respectively. The transition 
4(j) l(j + 1) captures more than one sub-step which includes opening of the 
finger domain, release of PPi and the forward movement of the DNAP to the 
next site on the template. 




Figure 1: A pictorial depiction of 5 state kinetic model for DNA polymerase 
(see the text for a detailed explanation). 

Immediately after completing one elongation cycle, the DNAP is normally 
ready to bind with a new substrate molecule and initiate the next elongation 
cycle. However, if a wrong nucleotide is incorporated in an elongation cycle, the 
DNAP is likely to transfer the nascent DNA from the pol site to the exo site. 
This switching from pol to exo activity is represented by the transition 1 — >■ 5 
which occurs at the rate fc^; the reverse transition, without cleavage, takes place 
at the rate kp. In the exo mode the cleavage of the last incorporated nucleotide, 
at the rate kexo, effectively alters the position of the DNAP from j + I to j. 

2.1 Force dependent chemical steps 

Not all the rate constants change with the tension F applied on the template. 
Tension-dependence of those rate constants uj which are influenced by the ten- 
sion F are assumed to have the form [19 

uj{F) = c^(0)e-^^*(^)/'=-^ (2) 

where is a phenomenological parameter, feg is the Boltzmann constant, T is 
the absolute temperature and A<I>(_F) is the free energy difference between the 
corresponding initial and final states. In our model A$(F) is the difference 
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between the free energies of a single base pair in a dsDNA and that in a ssDNA. 
The details of this calculation are given in Appendix A. 

Now we hypothesize that only the following transitions are affected by the 
tension F: (I) 3 -> 4, i.e., the polymerization step, where new dNTP subunit 
is incorporated into nascent DNA chain and a single stranded nucleotide is 
converted into a double stranded DNA. Negative value of A^{F) favours the 
incorporation of substrate molecule whereas positive value of A"I>(_F) disfavours 
the transition. 

(II) 1 — ;> 5 i.e., the transfer of the nascent DNA from the pol site to the exo 
site of theDNAP. These two catalytic sites are separated by 3.5A° and a trans- 
fer of the nascent DNA between them includes major change in the DNAP 
conformation that involves a /3 hairpin [501 ISIl ISl]- Moreover, polymerase to 
exonudease switching causes local melting of the dsDNA. Depending upon the 
sign of A<I>(_F'), tension applied on the template strand may favour or disfavour 
the 1 — >■ 5 transition. 

As we show in the next section, following force dependence of k^lF) and 
kx{F) shows a good qualitative agreement with the experimental data. 

W) = fc3(0)exp(-3A$(F)/ifBr) and k^iF) = A:,(0)exp(3A$(F)/i^sT) 

(3) 



3 Results 

3.1 Force velocity curve 

In this subsection we derive the force- velocity curve for our model DNAP motor 
and compare it with those reported earlier in the literature. Let P^{j^t) be the 
probability of finding DNAP in chemical state /i, at the position j on its track, 
at time t. The probability to finding the DNA polymerase in chemical state /i, 
irrespective of its position, is 

L 

P,{t)^Y.P>^^^^^) (4) 

i=i 

where L is the total number of nucleotides in template DNA strand. Normali- 
sation of the probability imposes the condition 

5 

at all times. The time evolution of the probability P^{t) is governed by following 
equations 

"^^'^^^ - -{k. + ki+ k^i)Pi{t) + fc_iP2(t) + kiPiit) + fcpPsW (6) 



dt 

dP2{t) 
dt 

dPsjt) 
dt 



= k,P,{t) - + k2)P2it) + k^2Pz{t) (7) 

= k2P2{t) - (fc_2 + A:3)P3 W + k^:iPi{t) (8) 
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Figure 2: Velocity of DNA polymerase is plotted against the force applied on 
template strand for a few different values of dNTP concentration. The numerical 
values of the parameters used for this plot are listed in table [TJ 



"^^^^^^ - k^iPiit) + hP.it) - (fc_3 + h)P4t) (9) 



dt 

dP^{t) 



KPi{t) ~ kpP^it) (10) 



dt 

Now we solve these equations in steady state and calculate the probability 
of finding the DNA polymerase in /ith chemical state [P^*")- 

< = ^ ^ (11) 

Xi + X2 + 2:3 +2:4+2:5 

Expressions for 2:^'s are given in Appendix B. 

Now we define the average rate of polymerization Vp and the average rate of 
excision Ve as 

Vp = Pfki - Pfk^i and Ve = ke^oPf (12) 
Therefore, the average velocity of the DNAP on its track is 

V^Vp-Ve (13) 

In figure Q the average velocity of the DNAP is plotted against the tension 
applied on DNA track. Rate constants used for this plot are collected from the 
literature [25] and listed in table [1] 



Because of the i^-dependence of the form assumed in ([3]), at lower tension tran- 
sition 2 3 is rate limiting while at higer values of tension 3^-4 becomes the 
rate limiting step. Frequent poly exo switching cause the significant increase 
in the exonuclease cleaving at higher forces. Observed trend of variation of the 
average velocity is the direct consequence of the nonmonotonic behavior of the 
A$(F), shown in figure (O. 
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Rate constant 


Numerical value 


ki 


50 /iAf-ifi-i 


k-i 


1000 


k2 


300 




100 


^3(0) 


9000 


k-3 


18000 


ki 


600 s-^ 


k-i 


25 s-^ 


kM 


.2 s-' 


kp 


700 


h 


900 s-^ 



Table 1: Numerical values of the rate constants used for graphical plotting of 
some typical curves obtained from the analytical expressions derived in this 
paper. 

3.2 Distributions of dwell times and exonuclease turnover 
times 

The average velocity of a DNAP and its dependence on the tension applied on 
the corresponding template does not provide any information on the intrinsic 
fluctuations in both the pol and exo activities of these machines. Probing fluc- 
tuations in the kinetics of molecular machines have become possible because of 
the recent advances in single molecule imaging, manipulation and enzymology. 
In this section we investigate theoretically how the fluctuations in the pol and 
exo activities of a DNAP would vary with the tension applied on the template 
DNA. For this purpose we use the same kinetic model introduced in section [TJ 
that we have used in subsection 12.11 for calculating the average properties of 
DNAP. 

The variable chosen to characterize the fluctuations in replication process 
is the time of dwell of DNAP at a single nucleotide on the template, which is 
nothing but the effective duration of its stay in that location. While moving on 
the one dimensional template strand three different mechanical steps are taken 
by DNAP, which are 

(1) Forward step in the pol mode: A{j) ~^ \{j + 1). 

(2) Backward step in the pol mode: l(j + 1) — > 4(j). 

(3) Backward step (caused by cleavage) in the exo mode: 5(j + 1) — 5(j). 

If a molecular motor takes more than one type of mechanical step then the 
fluctuations in the durations of its dwell at different locations cannot be char- 
acterized by a single distribution; instead, distributions of more than one type 
of conditional dwell times can be defined [3]. So, in the context of our model 
of DNAP, three different types of mechanical step would generate nine differ- 
ent distribution of conditional dwell times. We denote the forward, backward 
and cleavage steps are by the symbols — and x, respectively. ^mn{t) is the 
conditional dwell time of the DNA polymerase when step m is followed by n, 
where the three allowed values of each of the subscripts m and n are +,—,x. 
For the convenience of calculation of the distributions '^mn{t)^ first we assume 
that the DNAP is already at the jth site on the template strand and that the 
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rate constants for all the transitions leading to this special site j are equated to 
zero. In other words, 

(1) = only for the transition 4(j — 1) — l(j) (and not for any i ^ j), 

(2) fc_4 = only for l{j + !)—!> 4(j) (and not for any i ^ j), 

(3) kexo — only for 5(j + 1) ^ 5(j) (and not for any i ^ j). 

Now appropriate initial conditions will ensure the type of previous step taken 
by DNAP. 

If {j, t) is the probability of finding the DNA polymerase in ^th chemical 
state at site j at time i, then time evolution of these probabilities are governed 
by following master equation. 

dPi{j,t) ^ ^ ^ kx)Pi{j,t) + k^^P2{],t) + fcj,P5(j,t) (14) 



dt 

dP2{3,t) 
dt 

dP3{j,t) 



dt 

dPiij.t) 
dt 

dP^ii.t) 



kiPi{j,t) - (fc_i + k2)P2{3,t) + k^2Pz{j,t) (15) 

k2P2{],t) - (fc_2 + k3)P3ij,t) + fc_3A(i,0 (16) 

= hP3{j,t) - (fc4 + k^3)Piij,t) (17) 

= k,Pi{j,t) - {kp + k,,o)P5{j,t) (18) 



dt 

These equation can be re-expressed in the following matrix form. 

|p(t) = MP(t) (19) 

Here P(t) is a column matrix, whose elements are Pi{j,t), P2{j,t), Pj,{i,t), 
Pi{j,t) andP5(j,i). And 

— (fc_4 + fci + fca;) kp 

ki -(fc-i + fca) fc_2 

M= fca -(fc-2 + fc3) 

ks -{k4 + k-3) 

kx ~{kp ~l" kexo) 

(20) 

Now introducing the Laplace transform of the probabihty of kinetic states by, 

poo 

P^.{j,s) - / PAj,t)e-''dt (21) 

Solution of equation (fT9)) in Laplace space is, 

P(j,s)-(sI-M)-ip(j,0) (22) 

Here P(j, s) is the vector of the probability of individual chemical state in 
Laplace space and P(j, 0) is the column vector of initial probabilities. 
Determinant of matrix si — M is a fifth order polynomial 

det{sl - M) = as^ + Ps^ + js^ + Ss^ + es + (; (23) 

full expressions for a, /3, 7, S, e and C in terms of the primary rate constants 
are given in Appendix C. 
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3.2.1 Calculation of , 



Following set of initial conditions guarantees that previous step taken by DNA 
polymerase is a forward step. 



Pi(j, 0) = 1 and P2(j, 0) = PsU, 0) = P^ij, 0) = PsO', 0) ^ 



(24) 



So three different distribution of dwell time, where first step is forward, are 
defined as follows: 



(25) 
(26) 
(27) 

(28) 
(29) 

as^ + /3s4 + + + es + ( ^^^^ 

Mathematical expressions for ag, oi, bo, bi, 62, &3, cq, ci, C2 and C3 are given 
in Appendix D. 

By inserting the inverse Laplace transforms of the expressions ([28|. (|29)) and 
into the equations (1^ . (^5)) and (P71) . respectively, we get 



* + - ^4(i,i)fc4|[Pi(j,0) = l,P20-,0)=P3(j,0)=P40\0)=P50\0)=0] 

*+-(^) = A(j,i)fc-4|[PiO-,o)=i,P20-,o)=P30-,o)=P4(j,o)=P5(i,o)=o] 
*+x (0 = A ( j, t) kexo I [Pi (i.o)=i,P2 (j,o)=P3 (i.o)=P4 (i,o)=P5 (i,o)=o] 
By applying the initial condition ([M)) in equation (1^^ . we get 





ao + ais 






as"' + I3s^ 


+ 75^ + Ss^ H 


- es 


+ C 


biS^ + 63 


s3 + 62s2+&i 


s + 




as^ + Ps'^ 


+ 75-^ + Ss^ H 


- es 


+ c 


C3S^ H 


h C2S^ + CiS + 


Co 





(t^5 


- a;i)(tJ5 


- UJ2){^5 - W3)(W5 - LJi) 


(&0 


- + 


62^1 - bsOjf + &4wf )fc_4 


(wi 


- W2)('^l 


- UJ3){u!i - UJ4)i0Jl - LO5) 


(^^0 


- 61^2 + 


62W2 ~ ^3"^! + 64w|)fc_4 


(W2 


- a;i)(iU2 


- a;3)(w2 - uj4)ii^2 - t^s) 


(&0 


- 61CJ3 + 


62'^^! — ^31^3 + &4Cj|)fc_4 


(W3 


- a;i)(w3 


- UJ2){i^3 - i^4)ii^3 - t^s) 


(&0 


" 61^4 + 


b2U}l - &3'^4 + &4'^4)fc-4 


(W4 


- UJl){uJi 


- a;2)(t^4 - W3)(^^4 - t^s) 


(&0 


- biUJ5 + 


62^1 - 63^1 + &4t^|)fc-4 



(ws - a;i)(w5 - W2)(w5 - a;3)(^^5 - ^4) 



(flo — aiWi)fc4 






(CJI 


- a;2)(t^i - W3)(cji - a;4)(cji 


- W5) 








(ao — aiW2)fc4 






-LJ2t 


(t^2 


- UJi){uj2 - UJ3)i^2 - W4)(W2 


- ^^5). 








(ao - aiW3)fc4 






-LJ'jt 


(^^3 


- UJi){ll>3 - UJ2){l^3 - W4)(t^3 


- ^^5). 








(ao — aia;4)fc4 






-i04t 


{ll>4 


- UJi){ll>4 - UJ2){UJ4. - W3)(W4 










(ao - aia;5)fe4 









(31) 



(32) 
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(Co - ClWi + C2UjI - C3Ujf)kexo 



(UJI 


- W2)(a;i - 




- W4)(a;i - 


^^5) 




(co - C1W2 


+ C2UJ2 - 


- C3UJ^)kexo 






- 0Ji){uj2 - 


LO'i)(uj2 


- W4)(a;2 - 






(co - C1W3 


+ C2'jji - 








- 0Jl){uj3 - 




- tJ4)(w3 - 


W5) 




(co - C1W4 


+ C2W| - 


- Czul)kexo 




{UJ4 


- wi)(a;4 - 


W2)(W4 


~ W3)(W4 - 


W5) 




(co - C1W5 


+ 02^5 - 


- Cz^Dkexo 




(t^5 


- wi)(a;5 - 




- W3)(a;5 - 


W4) 



-1^5* 



where wi, W2, 1^3, 1^4 and 0^5 are the roots of following equation 



(33) 



(34) 



the explicit expressions of a, /3, 7, ^, e and C in terms of the primary rate con- 
stants of the kinetic model are given in appendix C. The coupled nature of 
the pol and exo activities is revealed by the mixing of the corresponding rate 
constants in the expressions of 5'+,± and +x- 



3.2.2 Calculation of * 



Following initial conditions ensures that DNA polymerase has reached to site j 
by making a backward step: 



P4(j, 0) = 1 and FiO; 0) = P2(j, 0) = P^{j, 0) = P^{j, 0) - 



(35) 



So three different distributions of dwell time, where first step is backward, are 
defined as follows: 



*-+(^) - ^4(i,Ofc4|[P4(j,0) = l,PiO-,0)=P20-,0)=P3(j,0)=P50\0)=0] 

* — (^) = ^i(j>Ofc-4|[P40\o)=i,PiO\o)=P20\o)=P30-,o)=P50-,o)=o] 

^-x{t) = -P5(j, i)fcexo|[P4(j,0) = l,Pi(j,0)=P20-,0)=P3(i,0)=P50\0)=0] 

After applying the above initial condition in equation (I22p . we get 

- , . , d4^s^ + ^35^ + (128^ + dis + do 



as5 + /3s4 ^ 


f 7S^ + Ss"^ - 


h es + C 


kx 


fc_ifc_2fc-3 






f ^s^ + (Js^ - 


h es + C 




eo + eis 





(36) 

(37) 
(38) 

(39) 

(40) 
(41) 



as^ + (3s^ + 7s3 + Ss"^ + es + C 

Full expressions for do, di, d2, d^, d^, eo and ei in terms of the primary rate 
constants of the kinetic model are given in Appendix D. Inverse transform of 
equation ([5^ . (HH)) and (HT|) gives the mathematical expression for P4{j,t), 
Pi{j,t) andP5(j,i). 
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Substituting the inverse Laplace transforms of ([5^ . (|40p and (pij) into the 
equations (155)) . ([57)) and (|55)) respectively, we get the following distributions of 
the conditional dwell time: 







d2'-o\ — d^LOi + d4U!f)k4 


[UJl - 






- W5) 


{do - 


- (iiW2 + 


^2^1 — dawf + d4w|)fc4 


(W2 - 


Wl)(W2 - 


- W3)(W2 - W4)(W2 


- W5) 


{do ~ 


~ diW3 + 


d2'^3 — dsUJ^ + c?4a;|)fc4 


(a;3 - 


a;i)(w3 - 


- a;2)(w3 - W4)(W3 


- W5) 


(do - 


- d\L04 + 


C?2'^4 — C^a'^l + C?4W4)fc4 


(W4 - 


Wl)(w4 - 


- a;2)(w4 - W3)(t^4 


- W5) 


(do- 


~ diW5 + 


^2^1 - dst^i + C?4w|)fc4 


(W5 - 


Wi)(w5 - 




LU4 ) 






_lfc_2fc-3fc-4 




(wi - 


W2)(wi - 


- W3)(a;i - W4)(a;i 


- W5) 




kxk 


_lA:_2fc_3fc_4 




(^2 - 




- W3)(a;2 - W4)(t^2 


- ^^5) 




k^k 


_l/c_2fc-3fc-4 








- '^2)(W3 - W4)(u;3 


- W5) 




kxk 


_lA:_2fc_3fc_4 




(W4 - 


UJl){u!i - 


- W2)(W4 - '^3)(t^4 


- W5) 




kxk 


_iA;_2fc-3fc-4 




(W5 - 


wi)(a;5 - 


- W2)(a;5 - W3)(a;5 


- a;4) 




(eo 


- eia;i)fcea;o 




(Wl - 


a;2)(wi - 


- a;3)(i:Ji - W4)(wi 






(eo 


- eiW2)fceKO 






UJi){l02 - 


- W3)(cj2 - W4)(a;2 


- W5) 




(eo 


- 61013)^62:0 




(W3 - 


a;i)(w3 - 


- a;2)(t^3 - W4)(a;3 


- W5) 




(eo 


- eiU}4)kexo 




(W4 - 


wi)(i:j4 - 


- 02) (W4 - 03) (04 


- ^^5) 




(eo 


- eia;5)fce:Eo 




(t^5 - 


Wi)(w5 - 


- 02) (^^5 - W3)(a;5 


- W4) 



-UJ2t 



-UJ^t 



(42) 



(43) 



e-'^s* (44) 

where wi, W2, 1^3, 1^4 and uj^ are the roots of the equation p4p . 
3.2.3 Calculation of v]/^^, v]/^ , vp^^ 

Now we consider the case where DNA polymerase has arrived at site i by making 
an exonuclease cleavage. The initial condition 



P5(j, 0) = 1 and Pi(j, 0) = P2(j, 0) = P3(i, 0) = Piij, 0) = 



(45) 
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ensures that previous mechanical step is an exonuclease cleaving. Now we define 
following distributions of conditional dwell time 



- ^^4(j,i)fc4|[P5(j,0) = l,Pi(j,0)=P20\0)=P30\0)=P4(j,0)=0] 

= Pi{j,t)k-4\[Pi,{j,o)=i,PiU,a)=P2U,o)=P3{j,o)=Pi{jfi)=o] 

"^xxit) — Pb{i, t)kexo\[Pr,(jfi) = l,Pi(jfl)=P2(jfi)=P3(]fl)=Pi(]fl)=0] 

After applying the above initial condition in equation I22[ we get 

Piii.s) 

Pi{j,s) 



(46) 

(47) 
(48) 

(49) 
(50) 
(51) 

as^ + /3s4 + + (5s2 + es + C 

The expressions for /o, /i, /2, /a, /4, go, 9i, 92 and 33 are given in Appendix 
D. The values of P4{j,t), Pi{j,t) and P5{j,t) are obtained from the inverse 
Laplace transform of the (j49p . (l50l) and (|50t . After inserting the values of 
P4{j,t), Pi{j,t) and Psijjt) in equations pS)) . (H7)) and (|T7l) . we get the exact 
analytical expression for ^ — \-{t), 5'— +(t) and ^ — 1-(<). 



fiS^ + h 




s + 


/o 




+ 75-^ + (Js^ + 


7 s 


+ c 




kik2k^kp 






as5 + /3s4 


+ 75^ + Ss^ 4 


es 


+ c 




-32S^ + .91S + 


.90 





*xx(i) = 



ifo 


- /iWl + 


/2wf - /3W1 + fiUjf}kexo' 


(wi 


- W2)(a;i 


- W3)(wi - a;4)(wi - u;5)_ 


(/o 


- /1W2 + 


/2W| - /3a;f + /4w|)fcej;o" 


(^2 


- UJi){uj2 


- W3)(W2 - U;4)('^2 - t^s). 


ifo 


- /1W3 + 


/2t^3 - /3W| + fiUjfjkexo' 


(^3 


wi)(a;3 


- W2)(w3 - a;4)(w3 - a;5)_ 


(/o 


- + 


./2t^| - /3'^| + fi^t)kexo' 


(W4 


- UJl)iuj4 


- W2)(w4 - a;3)(w4 - a;5)_ 


ifo 


- fl^5 + 


f2^l - fsO^I, + /4w|)fcea;o" 


(^^5 


- t^l)(t^5 


- W2)(w5 - a;3)(w5 - a;4)_ 






kik2k3knkp 


(cji 


- W2)(a;i 


- W3)(a;i - W4)(wi - W5)_ 






kik2k3kikp 


(t^2 


- 0Ji){uj2 


- W3)(a;2 - t^4)('^2 - ^^5). 






^1^2 ^3 k/^kp 


(^^3 




- UJ2)iuJ3. - W4)(W3 - ^^5). 






kik2k3k4kp 


(W4 


- wi)(a;4 


- W2)(a;4 - ^3)ii^4 - W5)_ 






fclfc2^3^4fcp 



-1^3* 



-Ci;4i 



(W5 - Wi)(a;5 - I1J2)(W5 - '^3)(W5 - ^^4) 



-UJit 



-1^3* 



(52) 



(53) 
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(50 - 


+ 92(^1 


- gzujl)k- 


-4 


(UJI 


- a;2)(t^i - 




- W4)(a;i 


- W5) 




(50 - 5l'^2 


+ 92(^2 


- 9Z^2)k^ 


-4 




- UJi){ll>2 - 


UJ3) {u>2 




- W5) 




{go - giuj3 


+ 92i^l 




-4 




- Ll:i){uj3 - 


W2)(t^3 


- a;4)(a;3 


- W5) 




(.90 - 51^4 


+ .921^1 


- 93^^l)k- 


-4 


(W4 


- a;i)(w4 - 


W2)(t^4 


- W3)(W4 


- W5) 




(50 - fflWs 


+ .92^1 




-4 


(t^5 




t^2){<^5 


- ^3) {(^5 


- W4) 



-UJ4t 



(54) 



where wi, UJ2, W3, 1^4 and ^5 are the roots of the equation (I34p . 

The distributions of the conditional dwell times 4'm„, except xx, are plot- 
ted for a few typical values of the parameters in figsl3] and SI Since "^xx is 
independent of the tension F , it has not been drawn graphically. We have also 
presented our numerical data, obtained from direct computer simulation, for 
the distributions plotted in figlS] Each of these distributions is a sum of several 
exponentials. Therefore, in general, these distributions are expected to peak 
at a nonzero value of time t. However, some of the distributions in fig|3] and 
m appear as a single exponential. This single-exponential like appearance is an 
artefact of the parameters chosen for plotting these curves although, in reality, 
the full distributions are sum of several exponentials. 

An interesting feature of the distributions plotted in figslS] and |4] is a non- 
monotonic variation of the probability of the most probable conditional dwell 
times with increasing F (see, for example, 5*++ and '5 ). This trend of vari- 
ation is a consequence of the nonmonotonic variation of with F (see figE]). 



4 Summary and conclusion 

DNA replication is carried out by DNAP which operates as a molecular motor 
utilizing the template DNA strand as its track. In this paper we have presented a 
theoretical model for DNA replication that allows systematic investigation of the 
pol and exo activities as well as their coupling. More specifically, the situation 
considered here mimics an in- vitro experiment where a tension is applied on 
the template strand throughout the replication process. We have calculated the 
effect of the tension on the average speed of replication, capturing the effects of 
both the pol and exo activities of the same DNAP. Our theoretical results are 
in good qualitative agreement with the results of single molecule experiments 
reported in the literature. 

However, the intrinsic fluctuations in the pol and exo processes contain some 
additional information which cannot be extracted from average properties. Dis- 
tributions of various conditional dwell times defined here characterize these fluc- 
tuations. In contrast to all the molecular motors whose dwell time distributions 
have been investigated earlier, DNAP is characterized by 9 distinct conditional 
dwell times because of the interplay of its pol and exo activities. Each of these 
conditional dwell times can be identified with an appropriate first-passage time. 
Therefore, using the methods of calculation of the distributions of first-passage 
times, we derive exact analytical expressions for all these conditional dwell times 
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for the DNAP motor. These expressions expUcitly show the effects of the tem- 
plate tension on the nature of the distributions. We beheve that, in principle, 
these distributions can be measured in single molecule experiments |33j . But, 
to our knowledge, these have not been reported so far in the literature although 
both the pol and exo activities of the DNAP have been studied extensively [34] . 
We hope our model and results will motivate experiments to study the unex- 
plored stochastic features of the kinetics of one of the most important genetic 
processes, namely DNA replication driven by DNAP. Understanding this kinet- 
ics will throw light on the propagation of life from one generation to the next. 

Acknowledgements: This work has been supported by the Dr. Jagmohan 
Garg Chair Professorship (DC) of IIT Kanpur, Department of Biotechnology of 
the Government of India (DC) and Council of Scientific and Industrial Research 
(AS). 

Appendix A 

Here the parameters with subscripts "1" and "2" correspond to ssDNA and 
dsDNA, respectively. Let bi{F) {i = 1,2) denote the average equilibrium pro- 
jections of base pair in the direction of the applied force F. Suppose, ^i{F) 
{i = 1,2) are the corresponding free energies. Then, for a given force F, the 
free energy difference between single base-pair of dsDNA and ssDNA can be 
expressed as [22] 

A$(F) = $2(i^) ~MF) = - f MF') - bi{F'))dF' (55) 

Jo 

where the right-hand side can be evaluated if the functions hi{F) are known. 
For the freely jointed chain (FJC) model of DNA, is established. 



coth 



( 2FA, \ _ KbT_ 
\Kbt) 2FA, 



l + |^)&r" (56) 



where Ki, Ai and 6™'*^ are, respectively, the elastic modulus, the persistence 
length and the average length of a base pair in the absence of any force. 

Inserting the expression (j56p into the equation (j55p we numerically compute 
the free energy difference between single base pair of dsDNA and that of ssDNA 
for the given force F. In figure [5| we plot A$ against the tension F. The 
numerical values of the parameters that we use for this computation are given 
in the table [2] Since A$ is assumed to be independnt of F beyond / = 61 pN; 
this captures the situation encountered in the in-vitro experiment of Wuite et 
al.[I7). 

Appendix B 

Xl = 1 (57) 
ki+X3k^2 



X3 



^2^1(^4 + fc-3) + ^-3^-4(^-1 + ^2) 



(fc4 + fc_3)(fc_i + fc2)(/c_2 + ^3) - k^ahik-i + k2) - k2k^2{ki + k^s) 

(59) 

^ k_^ + X3h (go) 
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Parameter values 


^max 


.58 nm 


Umax 


.34 nm 


Ai 


.7 nm 


A2 


50 nm 


Ki 


900 pN 


K2 


1000 pN 



Table 2: Numerical values of the relevant parameters used for the computation 
of A<1> using equation (1551) and (155)) 




f(PN) 



Figure 5: Free energy difference between dsDNA and ssDNA is plotted against 
the mechanical force applied on them 
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(61) 



Appendix C 



13 = ki + k2 + h + k4 + k-i + k-2 + k-3 + k-4 + kp + + k, 



a 



1 



'exo 



(62) 
(63) 



7 = kik2 + kika + k2k3 + kik4 + k2k4 + k3k4 + kexo{ki + k2 + k3 + k4 

+ kx + k-i + k-2 + fc_3 + k-4) + kp{ki + k2 + ks + k4 + k-i + k-2 

+ fc_3 + fc_4) + kx{k2 + fcs + fc4 + k-i + fc_2 + k-z) + k-ik-2 

+ fc_ifc_3 + k-2k-3 + k-ik-4 + k-2k-4 + k-^k-i + fci(fc_2 + fc^a) 



6 = kik2k3 + kik2k4 + k-ik3k4 + k2k3k4 + k-ik-2k-3 + k-ik-2k-4 

+ k-ik-3k-4 + k-2k-3k-4 + kexo{kik2 + kikz + k2k3 + kiki + ^2^4 

+ ^3^4 + k-ik-2 + k-ik-3 + k-2k-3 + k-ik-4 + fc_2fc-4 + fc_3fc_4) 

+ kexokx{k2 + A;3 + fc4 + k-i + fc_2 + k-3) + kexoikak-i + kik-i + A;ifc_2 

+ A;4A:_2 + kik-3 + ^2^-3 + ^2^-4 + fc3A;_4 + k4k-4) + kp{kik2 + /cifcs 

+ k2k3 + kik4 + ^2^4 + ^3^4 + k-ik-2 + k-ik-3 + k-2k-3 + fc_ifc_4 

+ k-2k-4 + k-3k-4 + ksk-i + k4k-\ + k\k-2 + ^4^-2 + k\k-3 + fefc-s 

+ fc2fc_4 + fc3fc_4 + kik^^) + kx{k2k3 + ^2^4 + fc3fc4 + k^ik^3 + k-2k-3 

+ k-ik-2 + fcsfc-l + k4k-i + k4k-2 + k2k-3) + fc3fc4fc_i + fcifc4fc_2 

+ fc4fc_ifc_2 + ^1^2^-3 + fclfc-2fc-3 + fc2fc3fc_4 + k2k4k-4 + k3k4k-4 

+ k3k-ik-4 + k4k-ik-4 + k4k-2k-4 + k2k-3k-4 (65) 

e = fcifc2fc3fc4 + /;;_ifc_2fc_3fc_4 + ^6x0(^1^2^3 + fci/;;2fc4 + fcifc3fc4 + ^2^3^:4 

+ fc_ifc„2fc-3 + fc_ifc_2fc_4 + /;:_ifc_3fc_4 + /c_2fc-3^-4) + k,,xokx{k2k3 

+ k2k4 + k3k4 + ksk-i + kik^i + ^4^-2 + A;_iA;_2 + A;2A;_3 + 

+ fc_ifc_3 + fc_2fc-3) + ke,xo{k:\k4k^i + fcifc4fc_2 + A;4/c_ifc_2 + kik2k-3 

+ kik-2k-3 + k2k3k-4 + k2k4k^4 + k3k4k-4 + ksk-ik-^ + k4k-ik-4 

+ k4k-2k-4 + k2k-3k-4) + kp{kik2k3 + kik2k4 + kik3k4 + k2k3k4 

+ k-ik-2k-3 + k-ik-2k-4 + k-ik-3k-4 + k-2k-3k-4 + k3k4k-i 

+ kik4k-2 + k4k-ik-2 + fcifc2fc-3 + fcifc_2fc_3 + k2k3k-4 + k2k4k-4 

+ fc3fc4fc_4 + fc3fc_ifc_4 + k4k^2k-4 + /C2fc-3fc_4 + kik^ik^^) + kx{k2k3k4 

+ k-ik-2k-3 + k3k4k-i + k4k-ik-2) + k4k-ik-2k-3 + k3k4k-ik-4 

+ k2k3k4k-4 (66) 

C = ^6x0(^1^2^3^4 + A;2fc3fc4A;x + fc3A;4A;xfc_i + fc4A;xfc_ifc_2 + fcxA;_iA;_2A;_3 

+ fc2fc3fc4fc_4 + fc3fc4fc_ifc_4 + fc4fc_ifc_2fc_4 + fc_ i fc_2 A'-3fc_4) 

+ kp{kik2k3k4 + k2k3k4k-4 + k3k4k-ik-4 + k4k-ik-2k-4 



+ k2{k-3 + k-4) + ksik-i + k-4) + k4{k-i + k-2 + k-4) 



(64) 



+ k-xk-2k-3k-4) 



(67) 
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Appendix D 



ao = kik2k3{kexo + kp) 
ai = kik2k3 



(68) 
(69) 



bo = {kexo + kp){k2kzki + k^kik-i + kAk-ik-2 + fc_ifc_2/c_3) (70) 



hi = k2k3k4 + k^kik-i + k4k-ik-2 + k-ik-2k-3 + kexo{k2k3 + k2k4 

+ k^ki + k-ik-2 + k-2k-3 + ksk-i + ksk-i + k^k-i + k2k-5 + kik-2) 

+ kp{k2k3 + ^2^4 + ^3^4 + k-ik-2 + k-2k-3 + k-sk-i + k^k-x + k^k-i 

+ kik-2 + k2k-z) (71) 



62 = feAja + ^2^4 + fc3fc4 + (fcea:o + fcp)(fe + ^3 + ^4 + fc-l + fc-2 + fc-3) 

+ kzk-i + kik-i + k4,k-2 + k-ik-2 -\- k2k-3 + k-ik-3 + k-2k-3 (72) 



^3 = + ^3 + ^4 + kexo + kp + k-i + k-2 + k-3 (73) 

64 = 1 (74) 

Co = kx{k2k3ki + k-ik-2k-3 + ^3^4^-1 + kik-\k-2) (75) 

Cl — i^x (^2^3 + ^2^4 + ^3^4 + fcafc-i + kik-\ + A;4A;_2 + k-ik-2 + A;2fc_3 

+ fc_ifc_3 + fc_2fc_3) (76) 

C2 = kx{k2 + ^3 + ^4 + k-i + k-2 + A;-3) (77) 

C3 = kx (78) 



rfo = kexo{kik2k3 + k2k3kx + kskxk-i + kxk-ik-2 + k2k3k-4 + ksk-ik^i 
+ k-ik-2k-4) + kp{kik2k3 + k2k3k-4 + k3k-ik-i + k-ik-2k-4,) (79) 

di = kik2k3 + k^ik^2k-4 + kexo{kik2 + kik3 + k2k3 + ksk^i + kik^2 

+ k2k-4 + ksk-i + k-ik-2 + k-ik-4, + fc_2fc_4) + kexokx{k2 + ^3 

+ fc_i + A;_2) + kp{kik2 + k\k3 + ^2/13 + k3k-i + A;iA;_2 + ^2^-4 

+ fc3fc_4 + k-ik-2 + k-ik-4 + k-2k-4) + kx{k2k3 + fcsfc-i + k-ik-2) 

+ k2k3k-4 + k3k-ik-4 (80) 

d2 = kik2 + kiks + A:2^3 + kexo{ki + + ^3 + fca; + k-i + k-2 + A:_4) 
+ kp{ki + ^2 + ^3 + fc-i + k-2 + k-4) + kx{k2 + /c3 + k-i + k-2) 
+ k3k-i + kik-2 + k-ik-2 + k2k-4 + k-ik-4 + k-2k-4 + k3k-4 (81) 

^3 = ^1 + ^2 + ^3 + kexo + kp + kx + + k-2 + A;_4 (82) 
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d4 = 1 



(83) 



Co = k-ik-2k-3{kexo + kp) (84) 
ei = k-ik-2k-3 (85) 

/o = kik2k3k4 + k^ik-2k-3k-4 + kx{k2k3k4 + k3k4k-i + k4k^ik^2 
+ k-ik-2k-3) + k3k4k-ik-4 + k4k-ik-2k-4 + k2k3k4k-4 (86) 



/i = kik2k3 + kik2k4 + kik3k4 + k2k3k4 + k3k4k-i + kik4k-2 + k4k-ik-2 

+ kik2k-3 + kik-2k-3 + k-ik-2k-3 + k2k3k-4 + ^2^4^-4 + k3k4k-4 

+ k3k-ik-4 + k4k-ik-4 + k4k-2k-4 + k-\k-2k-4 + k2k-3k-4 

+ fc_ifc_3fc_4 + k-2k-3k-4 + kx{k2k3 + ^2^4 + ^3^4 + fcsfc-i + A;4fc_i 



+ A;4fc-2 + k-ik-2 + fc2fc_3 + k-ik-3 + ^-2/0-3) (87) 

/2 = fcifc2 + fcifcs + fc2fc3 + A:ifc4 + A:2fc4 + ^3^4 + fcsfc-l + ^4^-1 + fcifc_2 
+ fc4A;_2 + k-ik-2 + kik-3 + k2k-3 + k-ik-3 + fc_2fc_3 + fc2fc_4 

+ ^3^-4 + A;4A;_4 + k-ik-4 + k-2k-4 + k-3k-4 + kx{k2 + fc3 + ^4 

+ + k-2 + k-3) (88) 

f3 = ki + k2 + k3 + k4 + kx + k-i + k-2 + k-3 + k-4 (89) 
U = 1 (90) 

go = kp{k2k3ki + kskik-i + fc4fc_ifc_2 + k-ik-2k-3) (91) 

91 = kp{k2k3 + k2k4 + k3k4 + ksk-i + k4k-i + k4k-2 + k-ik-2 + k2k-3 
+ k-ik-3 + k-2k-3) (92) 

g2 = kp{k2+k3 + k4 + k-i + k-2 + k-3) (93) 
53 = kp (94) 
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